AITopics | cloud datacenter

Collaborating Authors

cloud datacenter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Sustainable Carbon-Aware and Water-Efficient LLM Scheduling in Geo-Distributed Cloud Datacenters

Moore, Hayden, Qi, Sirui, Hogade, Ninad, Milojicic, Dejan, Bash, Cullen, Pasricha, Sudeep

arXiv.org Artificial IntelligenceMay-30-2025

In recent years, Large Language Models (LLM) such as ChatGPT, CoPilot, and Gemini have been widely adopted in different areas . As the use of LLMs continues to grow, many efforts have focused on reducing the massive training overheads of these models. But it is the environmental impact of handling user requests to LLMs that is increasingly becoming a concern. Recent studies estimate that the costs of operating LLMs in their inference phase can exceed training costs by 25 per year. A s LLMs are queried incess antly, the cumulative carbon footprint for the operational phase has been shown to far exceed the footprint during the training phase. Further, estimates indicate that 500 ml of fresh water is expended for every 20 - 50 requests to LLMs during inference. To address these important sustainability issues with LLMs, we propose a novel framework called SLIT to co - optimize LLM quality of service (time - to - first token), carbon emissions, water usage, and energy costs . The framework utilizes a machine learning (ML) based metaheuristic to enhance the sustainability of LLM hosting across geo - distributed cloud datacenters. Such a framework will become increasingly vital as LLMs proliferate.

datacenter, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2505.23554

Country: North America > United States (0.47)

Genre: Research Report (1.00)

Industry:

Information Technology > Services (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Bringing AI to the Edge

Communications of the ACMMay-2-2025, 15:27:19 GMT

This year, U.S. rail carrier Amtrak will be installing two novel inspection gateways from Duos Technologies along its busy Northeast Corridor. The barn-like Duos structures straddle railway tracks; as passenger trains speed through at up to 125 miles per hour, 97 cameras and dozens of LED lights arrayed around the sides, top, and bottom of the tracks will capture thousands of high-resolution images of the railcars. These images are aggregated and processed on site in real time to present a complete, 360-degree, highly detailed view of the train. Artificial intelligence (AI) algorithms running on Nvidia GPUs will analyze the images locally; if the model flags a potential structural or mechanical flaw, train personnel will be notified in less than a minute. The Duos portal is one of many new examples of what is loosely categorized as edge AI, or the deployment and operation of AI models outside of massive cloud datacenters.

large language model, machine learning, natural language, (20 more...)

Communications of the ACM

Country:

Europe (0.30)
North America > United States > California (0.15)

Industry: Transportation > Ground > Rail (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)

Add feedback

Radium looks to speed up AI and ML jobs in cloud datacenters

#artificialintelligenceDec-2-2021, 12:48:07 GMT

Today, Radium, a startup that aims to use artificial intelligence and machine learning to extract more computing power from cloud hardware, announced it was leaving stealth mode and deploying its solutions to cloud datacenters run by Cyxtera in Toronto, the New York and New Jersey metro area, and Silicon Valley. The main product, called Launchpad, lets users start and shut down projects on bare metal machines, eliminating the extra layers of hypervisors and virtualization software. Radium offered benchmark tests on machine learning jobs that showed speed increases ranging from 30% and 140%. "Our initial testing shows that bare metal servers offer a good cloud computing platform for the high-performance deep learning and inference workloads required for these types of applications," said Srinivasa Narasimhan, a professor at Carnegie Mellon's School of Computer Science, who has been working with the company to test its product. Many cloud products rely heavily on virtualization software layers, or "hypervisors," that allow one physical machine to simulate a variety of smaller machines that appear independent to users.

cloud datacenter, datacenter, radium, (11 more...)

#artificialintelligence

Country:

North America > United States > New York (0.26)
North America > United States > New Jersey (0.26)
North America > United States > California (0.26)
North America > Canada > Ontario > Toronto (0.26)

Industry: Information Technology (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Cloud datacenters anticipated to become largely robot-dependent by 2025

#artificialintelligenceNov-22-2021, 23:32:29 GMT

In a strong endorsement to the value of artificial intelligence and machine learning, research firm Gartner predicts that half of cloud datacenters will be leveraging advanced robots by 2025. Gartner believes these AI-centered deployments will ramp up the operating efficiency of datacenters by a margin of 30%. So what role will robots play in cloud datacenters? Why are these robots considered so vital, and what motivates businesses to adopt them at such a robust pace? The typical workflow in a cloud datacenter comprises a host of different actions.

cloud datacenter, datacenter, deployment, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback